Picture for Guohui Zhang

Guohui Zhang

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Add code
May 12, 2026
Viaarxiv icon

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Add code
May 05, 2026
Viaarxiv icon

Thinking in Structures: Evaluating Spatial Intelligence through Reasoning on Constrained Manifolds

Add code
Feb 08, 2026
Viaarxiv icon

MaskFocus: Focusing Policy Optimization on Critical Steps for Masked Image Generation

Add code
Dec 21, 2025
Viaarxiv icon

Group Critical-token Policy Optimization for Autoregressive Image Generation

Add code
Sep 26, 2025
Viaarxiv icon

Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach

Add code
May 27, 2025
Figure 1 for Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach
Figure 2 for Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach
Figure 3 for Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach
Figure 4 for Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach
Viaarxiv icon

DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles

Add code
Dec 30, 2024
Figure 1 for DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles
Figure 2 for DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles
Figure 3 for DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles
Figure 4 for DEMO: A Dynamics-Enhanced Learning Model for Multi-Horizon Trajectory Prediction in Autonomous Vehicles
Viaarxiv icon

Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling

Add code
Sep 02, 2024
Figure 1 for Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling
Figure 2 for Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling
Figure 3 for Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling
Figure 4 for Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling
Viaarxiv icon

World Models for Autonomous Driving: An Initial Survey

Add code
Mar 05, 2024
Figure 1 for World Models for Autonomous Driving: An Initial Survey
Figure 2 for World Models for Autonomous Driving: An Initial Survey
Figure 3 for World Models for Autonomous Driving: An Initial Survey
Figure 4 for World Models for Autonomous Driving: An Initial Survey
Viaarxiv icon

A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization

Add code
Jul 13, 2021
Figure 1 for A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization
Figure 2 for A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization
Figure 3 for A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization
Figure 4 for A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization
Viaarxiv icon